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SYSTEM AND METHOD FOR PROVIDING BLACK WHITE 
IMAGE COMPRESSION 

BACKGROUND OF THE INVENTION 

1. Technical Field 

The present invention relates generally to image compression, and more 
specifically relates to a system and method for providing black white image data 
compression. 

2. Related Art 

Since the advent of the modern information technology age, systems have been 
developed to store, process, or communicate black white image data retrieved from 
printed documents. Common examples include, for instance, facsimile machines, 
copiers, scanners, etc. In many instances, the system is required to archive, store or 
transmit the black and white image data. To improve performance, the image data is 
compressed, thereby reducing storage, processing and bandwidth requirements. 

Black white image data compression techniques are well known in the art. Due to 
the universal need for encoding and decoding (namely compression and decompression) 
of image data, the industry has adopted standards, which are widely used today. COTT- 
G4, for example, is one of the most popular standards, finding its most extensive use in 
facsimile machines. Typical compression ratios using COTT-G4 are on the order of 10 
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- 15x. Ultimately, the amount of compression is a function of the black white image 
information/data content. 

In the case of most black white compression standards, such as CCITT-G4, the 
compression process, which is engineered to be an encoding process, is lossless, i.e., 
following the decompression process, all information present in the original image is 
fully recovered. Thus, limitations exist with respect to the amount of compression that 
can be achieved. 

However, numerous present day industries are increasingly required to archive 
vast amounts of information in electronic form. Examples include the finance industry 
saving check images, the insurance industry saving documents, the health care industry 
saving medical records, the legal industry, federal and state governments, etc. 
Accordingly, data compression for black white images remains of vital interest. 

SUMMARY OF THE INVENTION 

The present invention addresses the above-mentioned problems by providing a 
methodology to obtain improved bitonal (e.g., black white) image compression, while 
maintaining compatibility with industry standard encoding and decoding techniques. In a 
first aspect, the invention provides a compression system for compressing a bitonal 
image, comprising: a scaling system for geometrically scaling the bitonal image, wherein 
the scaling system reduces contiguous sets of original pixels down to a smaller set of 
scaled pixels; and an encoding system that encodes the scaled bitonal image using an 
industry standard technique. 
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In a second aspect, the invention provides a method for compressing a bitonal 
image, comprising: geometrically scaling the bitonal image; and encoding the scaled 
bitonal image using an industry standard technique. 

In a third aspect, the invention provides a program product stored on a recordable 
medium for processing a bitonal image, comprising a scaling system having: means for 
selecting a pair of pixels from the original bitonal image; means for forming a four digit 
binary value made up of values from the pair of pixels and two flanking pixels; means for 
calculating a single binary pixel value from the four digit binary value; and means for 
generating a scaled bitonal image in which the pair of pixels is replaced with the single 
binary pixel value. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features of this invention will be more readily understood from 
the following detailed description of the various aspects of the invention taken in 
conjunction with the accompanying drawings in which: 

Figure 1 depicts a bitonal compression system in accordance with the present 
invention. 

Figure 2 depicts a scaling operation in accordance with the present invention. 
Figure 3 depicts a table used in the scaling operation depicted in Figure 2. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a methodology for compressing images, beyond 
the capabilities of standardized encoding technologies, such as that provided by CCITT- 
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G4. While the embodiments described herein are described with reference to black white 
images, it should be recognized that the scope of the invention may be applied any type 
of bitonal image. Moreover, while the invention is described with reference to a CCITT- 
G4 encoding system, it is understood that any known bitonal encoding technique could be 
utilized. Examples include ABIC, JBIG, etc. 

The present invention recognizes that two attributes of a bitonal (e.g., black 
white) image are responsible for making the image legible and readable. The first 
important attribute involves transitions. As a black white image is scanned (similar in 
manner as a CRT scans a screen), one notices the presence of transitions, both from white 
to black and black to white. It is this aspect that gives the image contrast, readily noticed 
by the human eye. When properly placed transitions occur on multiple scan lines, legible 
and readable information begins to appear. 

The second important attribute involves the very small (i.e., single) isolated 
features, namely, single black pixels between white pixels or single white pixels between 
black pixels. 

These isolated features provide richness and sharpness to an image. As long as 
the scaling system is able to preserve these two attributes, and substantially maintain their 
relative geographic locations, a transformed image will offer near identical informational 
content. 

Typical black white images (such as checks and other printed documents) have 
approximately 10% of their pixel space represented by black pixels. Since many of these 
black pixels will invariably be clustered together, a few transition points (relative to the 
entire pixel space) will be encountered when the image is scanned. Similarly, a typical 



CHA920030031US1 



4 



black white image will also contain occasional isolated features. For example, there may 
be areas contained in the image where a single black pixel is sandwiched between white 
pixels, or where a single white pixel is sandwiched between black pixels. 

A fundamental feature of the invention is to geometrically scale (referred to 
herein as "smartscaling") the image to reduce the pixel count while maintaining the two 
aforementioned attributes. In the exemplary embodiment described below, the pixel 
count is reduced by 2x in both the X and Y direction. However, it should be understood 
that any pixel reduction that maintains the two attributes could be utilized. Accordingly, 
smartscaling refers to any bitonal pixel reduction in which transition and isolated 
attributes are preserved, and the attributes' relative geographic positions are substantially 
maintained. 

Referring now to Figure 1, a compression system 10 is shown that receives a 
black white image 18 and outputs compressed image data 20, which can be, e.g., 
archived, transmitted, processed, etc. The black white image 18 is first submitted to 
scaling system 12. Scaling system 12 "smartscales" the image, resulting in a pixel 
reduction in which transition and isolated attributes are preserved, and the attributes' 
relative geographic positions are substantially maintained. Although not required, other 
filtering and processing operations could also be applied to further improve the quality of 
the smartscaled image 22. The smartscaled image 22 is then submitted to a COTT-G4 
encoding system 14, which performs an industry standard encoding operation (commonly 
used in facsimile operations and the like). Note that other industry standard encoding 
system could likewise be used. 
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Once encoded, the smartscaled image 22 can be retrieved by decoding the 
compressed image data 20 with an industry standard CCITT-G4 decoder. The scaled 
black white image can then be manipulated as needed (e.g., displaying, printing, 
zooming, expanding) using know techniques and systems with aliasing corrections. 

Using the compression system 10 described above, a size reduction of about 45% 
can be achieved over the compression achieved by a stand-alone COTT-G4 encoding 
system 14. Note that a smartscaled image 22 may have limited, localized, geometric 
distortion. For instance, a single pixel may sometimes be displaced by one pixel position. 
However, this distortion is only evident when viewed under high magnification. 
Accordingly, for most all applications (such as E13B OCR readability), any distortion 
will not diminish or affect the information content being presented in the image. 

As noted above, scaling system 12 provides a pixel reduction in which transition 
and isolated attributes are preserved, and the attributes' approximate relative geographic 
positions are substantially maintained. Figures 2-3 depict an exemplary smartscaling 
embodiment in which the number of pixels is reduced by a factor of two. Specifically, as 
shown in Figure 2, a first partial row 30 of pixels pO, pi ... p8 are shown prior to scaling, 
and a second partial row 32 of pixels PI, P2, P3, are shown after scaling. Each pixel of 
the black white (or any bitonal) image has a value of either 1 or 0, where for example 1 
represents white and 0 represents black. In this example, pixels pl-p6 are scaled down to 
P1-P3, i.e., contiguous sets of pixel pairs pl-p2, p3-p4, and p5- p6 are scaled to PI, P2, 
and P3, respectively. 

In an exemplary embodiment, scaling is achieved using an algorithm involving 
the table shown in Figure 3. Specifically, when reducing two original pixels down to one 
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scaled pixel, the algorithm must decide whether to assign the scaled pixel a value of 1 or 
0. The algorithm decides this by examining the two original pixels and the two flanking 
pixels that neighbor the two original pixels. Accordingly, when scaling pixels pi and p2, 
the algorithm examines pixel values for pO, pi, p2 and p3, in this case 0010. The four 
values provide the input into the table shown in Figure 3. In this case, the input 0010 
refers to index number 02, and yields an output value of 1. Accordingly, the scaled value 
for the original pixel pair pi, p2 is 1. 

Similarly, the scaled value P2 for the original pixel pair p3, p4 is determined by 
examining the four values of pixels p2-p5 (i.e., 101 1). In this case, the input 101 1 refers 
to index number 11, and yields an output value of 0. The scaled value P3 for the original 
pixel pair p5, p6 is determined by examining the four values of pixels p4-p7 (i.e., 1 100). 
In this case, the input 1 100 refers to index number 12, and also yields an output value of 
0. 

In the table shown in Figure 3, four pixels are shown in the input column and the 
index number is their decimal representation. It is the two central pixels of the input 
column that are replaced by the output. Output values are determined based on the four 
digit binary input value such that transition and isolated attributes are preserved in the 
scaled image, and the attributes' relative geographic positions are substantially 
maintained. For instance, it can be seen that "0 01 0" has an isolated feature (i.e., a 1 
sandwiched between zeros). Accordingly, this results in an output value of 1. The input 
"1 10 0" has a transition from l's to 0's. Accordingly, the output value is 0 to ensure that 
the transition is maintained. 
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The table depicted in Figure 3 provides a 2x image reduction. Obviously, various 
alternative embodiments (e.g., examining a six digit binary input number, providing a 3x 
image reduction, etc.) could be implemented to achieve similar results. In a typical 
embodiment, the image may be first scaled in one direction, e.g., horizontally, and then 
be scaled in the other direction, e.g., vertically. 

It is understood that the systems, functions, mechanisms, methods, and modules 
described herein can be implemented in hardware, software, or a combination of 
hardware and software. They may be implemented by any type of computer system or 
other apparatus adapted for carrying out the methods described herein. A typical 
combination of hardware and software could be a general-purpose computer system with 
a computer program that, when loaded and executed, controls the computer system such 
that it carries out the methods described herein. Alternatively, a specific use computer, 
containing specialized hardware for carrying out one or more of the functional tasks of 
the invention could be utilized. The present invention can also be embedded in a 
computer program product, which comprises all the features enabling the implementation 
of the methods and functions described herein, and which - when loaded in a computer 
system - is able to carry out these methods and functions. Computer program, software 
program, program, program product, or software, in the present context mean any 
expression, in any language, code or notation, of a set of instructions intended to cause a 
system having an information processing capability to perform a particular function 
either directly or after either or both of the following: (a) conversion to another language, 
code or notation; and/or (b) reproduction in a different material form. 
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The foregoing description of the preferred embodiments of the invention has been 
presented for purposes of illustration and description. They are not intended to be 
exhaustive or to limit the invention to the precise form disclosed, and obviously many 
modifications and variations are possible in light of the above teachings. Such 
modifications and variations that are apparent to a person skilled in the art are intended to 
be included within the scope of this invention as defined by the accompanying claims. 
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